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1.  Summary 


The  application  of  computational  mathematics  and  information  science  to  biology  has 
aided  in  the  understanding  of  biological  systems.  Today  biology  can  now  aid  information 
science.  This  research  activity  addresses  this  new  and  potentially  symbiotic  relationship 
between  biology  and  information.  In  this  report  a  biocomputational  analysis  of  a 
biologically  represented  network  is  demonstrated.  A  report  on  new  DNA  aqueous 
laboratory  computing  techniques  is  also  given. 


2.  Introduction 

Classical  group  testing  (CGT)  is  a  widely  used  biotechnical  technique  to  identify  a 
relatively  few  number  of  distinguished  objects  when  the  presence  of  any  one  of  these 
distinguished  objects  in  a  pool  produces  an  observable  result.  This  report  describes  the 
algorithmic  development  of  a  variant  of  the  classical  group  testing  paradigm  called  group 
testing  on  graphs  (GTOG.)  The  difference  between  these  two  abstract  models  is  that 
GTOG  detects  edges  in  a  network  as  opposed  to  vertices.  DNA  laboratory  techniques  and 
statistical  factor  analysis  are  combined  to  identify  covert  structures  in  a  network  with 
edges  represented  by  DNA  duplexes. 

In  the  laboratory,  a  variety  of  methods  have  been  used  to  detect  and  modify  double- 
stranded  DNA  so  that  it  can  be  used  to  perfonn  computations  and  convey  information. 
DNA  strands  have  been  successfully  tagged  with  4  different  small  molecules,  Alexa  fluor 
488,  Bodipy  FL,  digoxigenin  and  biotin  that  enable  molecular  reading.  The  first  two 
molecules  are  fluorescent  and  reading  protocols  employing  antibodies  and  binding 
proteins  that  recognize  these  molecules  have  been  tested.  These  protocols  give  means  of 
tagging  and  reading  DNA  for  aqueous  computations. 


3.  Methods 

3.1  Algorithmic  methods  for  DNA  network 

Throughout  this  paper,  all  simple  lower  case  Roman  variables  are  non-negative  integers. 
Given  set  S  ,  |  S  |  denotes  its  cardinality.  Let  [/?]  =  {1,2 ,...,«}  represent  a  finite  set 
with  n  elements  which  is  called  the  population.  A  k  -set  in  [/?]  is  a  k  element  subset. 
Let  r  =  {Cj,...,Q}  be  an  unknown  collection  of  d  disjoint  edges  in  [//]  .  We  refer  to 
r  as  the  set  of  covert  edges.  In  GTOG  problem,  the  goal  is  to  identify  at  least  one 
member  of  each  covert  edge  in  T  by  performing  certain  0,1  tests  on  subsets  from  [/?] . 
A  test,  P  c:  [n],  is  said  to  be  positive  if  and  only  it  completely  contains  an  edge. 
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In  our  application,  the  graph  consists  of  33  nodes  represented  by  33  distinct  DNA  strands 
designed  (specifically  for  this  proof  of  principle  study)  by  modifying  a  SynDCode 
generated  DNA  code  [1].  A  DNA  code  of  size  n  is  a  collection  of  DNA  strands  with  the 
property  that  no  two  strands  (including  copies  of  identical  strands)  have  a  free  energy  of 
duplex  formation  below  a  certain  threshold.  This  free  energy  threshold  is  selected  to 
ensure  that  no  two  strands  will  hybridize  at  prescribed  experimental  conditions  (e.g., 
temperature,  ionic  concentration.)  A  complemented  DNA  code  of  size  2n  consists  of  n 
complementary  pairs  with  the  property  that  no  two  non-complementary  pairs  (including 
copies  of  identical  strands)  have  a  free  energy  of  duplex  formation  below  a  certain 
threshold.  A  complemented  DNA  code  is  also  known  as  a  DNA  tag-antitag  system.  The 
graph  tested  has  its  vertex  set  the  strands  {A,B,...AI,AJ}-{B,H,AE}  with  its  edges 
depicted  in  Figure  1.  Desired  edges  are  Watson-Crick  duplexes  and  covert  edges  are 
cross-hybridized  duplexes. 


A=  AGGCTAAAGTTATCAC _ S  =  GTGATAACTTTAGCCT 

C=  GTCTTCGTTTTTTTCA - U  =  TGAAAAAAACGAAGAC 

OD=  GCAAGCGACCAATACT  ,V  =  AGTATTGGTCGCTTGC 

E=  TACCTTTTCTCGACGC _ J-W  =  GCGTCGAGAAAAGGTA 

F=  CTCAATAAAATGCGCG - l~X  =  CGCGCATTTTATTGAG 

OH=  CGTTGCACTCAAGATC  /  Z  =  GATCTTGAGTGCAACG 

1=  GACTGGAATGTTTTGT - =  ACAAAACATTCCAGTC  complemented 

J=  GGATGCAGGTTGATTA - AB  =  TAATCAACCTGCATCC  DNA  Code 

K=  AAGCCTTAGAAGAGAG - 7 A AC  =  CTCTCTTCTAAGGCTT 

L=  TTTCTGTGGCACTGGT _ ]-/- _ AD  =  ACCAGTGCCACAGAAA 

M=  TGTGTGTCCGATGAGAn  1/  OAE  =  TCTCATCGGACACACA 

N=  TTAAAAGACGTTGGTTA; - A - AF  =  AACCAACGTCTTTTAA 

0=  TACGCTAATCGGTAAG - AG  =  CTTACCGATTAGCGTA 

Q=  TGGAGGAACTACCGGA - AI  =  TCCGGTAGTTCCTCCA 

R=  CCATAGCTGAGTTCTT  //  \ - AJ  =  AAGAACTCAGCTATGG 

/  /  N  - 


B=  CGTTGCcCTCAAGATC  f - T  =  GATCTTGAGgGCAACG 

G=  GCAAGCGtCCAATACT - \Y  =  AGTATTGGaCGCTTGC 

P=  TGTGTGaCCGATGAGA - AH  =  TCTCATCGGtCACACA 


strands  modified 
to  cross-hybrid  ize 


Figure  1:  DNA  Network 

The  strands  (B,G,P,T,Y  and  AH)  excluded)  were  generated  by  SynDCode  [1].  In 
this  complemented  DNA  code,  no  non-complementary  pair  has  a  AG  of  duplex 
formation  less  than  -8KCal/Mol  at  37C.  In  addition,  no  non-complementary  pair 
has  a  secondary  structure  with  more  than  five  2-stems,  or  three  3 -stems,  or  two  4- 
stems.  Moreover,  since  these  conditions  hold  for  potential  interactions  between 
identical  strands,  these  thresholds  also  hold  for  all  possible  single  stranded 
hairpin  structures.  Strands  D,  H,  and  AE  were  deleted  from  the  collection.  With 
these  stands  deleted,  exactly  three  CH  duplexes  exist  at  37C,  namely  B:Z,  G:V, 
and  M:AH.  Proper  edges  are  indicated  by  solid  lines  and  covert  edges  by  dashed 
lines. 
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Edge  tests  for  this  for  this  graph  consisted  of  a  collection  of  strands  that  did  not  contain  a 
pair  of  complementary  strands  (i.e.,  the  desired  edges).  Thus  each  edge  test  corresponds 
to  an  independent  set  in  the  desired  (or  assumed)  graph.  This  edge  test  was  then  treated 
with  Sybr  Green  I.  This  nonsymmetrical  cyanine  dye  is  known  to  produce  exponentially 
greater  fluorescence  when  bound  to  duplex  DNA  as  opposed  to  single-stranded  DNA. 
The  pools  were  constructed  in  a  pseudo-random  method  that  is  a  heuristic  modification  to 
a  purely  random  method  that  can  be  applied  to  standard  and  non-complemented  DNA 
codes. 

3.2  Laboratory  method  for  labeling  DNA 

DNA  labeling  was  accomplished  primarily  using  different  types  of  polymerase  chain 
reaction  (PCR)  conditions.  In  general,  the  multiple  cloning  region  of  pBluescript  II 
plasmid  with  the  primers  from  the  -20  and  reverse  primer  from  M13  was  used  in  PCR 
which  amplifies  the  multiple  cloning  region  of  this  plasmid  to  form  a  ~250bp  DNA 
fragment. 


Oligonucleotide 

Sequence 

5'PCRT1T2L 

TTGTAAAACGACGGCCAGTGGATCC  CCAAACCTCCACTTT  CCAAC  ACACAACTCC 

5'PCRT  1 F2L 

TT GT AAAACGACGGCCAGTGGAT CC  CCAAACCTCCACTTT  CCAACCCTACACCAC 

5'PCRF1T2L 

TTGTAAAACGACGGCCAGTGGATCC  CAACCAACCACT CTA  CCAACACACAACTCC 

5'PCRF1  F2L 

TTGTAAAACGACGGCCAGTGGATCC  CAACCAACCACTCTA  CCAACCCTACACCAC 

C(T  1 ) 

GTTGGAAAGT  GGAGGTTTGG 

C(F  1 ) 

GTTGGTAGAG  TGGTTGGTTG 

T2RT3L 

TCCACAATCA  CCTTTCCTCC 

T2RF3L 

TCCACAATCA TCACACACAC 

F2RT3L 

CTACACCTTT  CCTTTCCTCC 

F2RF3L 

CTACACCTTT TCACACACAC 

C(T2) 

TGATTGTGGA  GGAGTTGTGT 

C(F2) 

AAAGGTGTAG  GTGGTGTAGG 

T3RT4L 

ATCACCTCAT  CCTCACTCTC 

T3RF4L 

ATCACCTCAT  CACCTCTCTC 

F3RT4L 

ACACACAATT  CCTCACTCTC 

F3RF4L 

ACACACAATT  CACCTCTCTC 

C(F3) 

AAT  TGT  GTG  TGT  GTG  T  GTG  A 

C(T3) 

ATGAGGTGAT  GGAGGAAAGG 

T4RT5L 

ACTTCCTTCA  TCTCCTCTCC 

T4RF4L 

ACTTCCTTCA  TTTCCACCAC 

F4RT5L 

ACTTCTTCCA  TCTCCTCTCC 

F4RF5L 

ACTTCTTCCA  TTTCCACCAC 

C(T4) 

TGAAGGAAGT  GAGAGTGAGG 

C(F4) 

TGGAAGAAGT  GAGAGAGGTG 

T5R3'PCR 

ACT CAAAACC  A  AGC  TTC  ATG  GTC  ATA  GCT  GTT  TCC 

F5R3'PCR 

CT CAACACAT  A  AGC  TTC  ATG  GTC  ATA  GCT  GTT  TCC 
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C(T5)3’PCR-BIOTIN 

/5Bio/GGAAACAGCTATGACCAT GAAGCTTGGTTTT GAGT  GGAGAGGAGA 

C(F5)3'PCR-BIOTIN 

/5Bio/GGAAACAGCTAT GACCAT GAAGCTT  ATGTGTTGAG  GTGGTGGAAA 

5'PCR  (M13-UP  BamHI) 

TT  GT  AAAACGACGGCCAGT  GGAT  CC 

3'PCR  (M13-RP  Hindlll) 

GGAAACAGCTATGACCATGAAGCTT 

AF5'PCR  (M13-UP  BamHI) 

/5Alex488N/TTGTAAAACGACGGCCAGTGGATCC 

Biotin3'PCR(M13-RPHindlll) 

/5Bio/GGAAACAGCTAT  GACCAT  GAAGCTT 

Figure  2:  Oligonucleotide  sequences  for  construction  of  DNA  library 

3.3  Laboratory  methods  for  DNA  network  pooling 

Pools  consisted  of  0.5  pL  of  each  DNA  strand,  5  pL  25  mM  MgCf>.  5  pL  reserved  for 
SYBR  Green  I  master  mix  distilled  deionized  H2O  to  ensure  the  final  volume  of  pool  is  50 
pL.  The  ddH20  was  added  to  tubule,  then  MgC12,  and  each  strand  that  would  be  included. 
The  tubule  was  centrifuged,  placed  in  a  PCR  machine  for  the  program  listed  below, 
SYBR  master  mix  was  then  added,  and  tubules  were  agitated,  centrifuged,  and  then 
transferred  to  96-well  plate.  The  mix  was  run  in  a  Real  Time  PCR  thermocycler  over  a 
temperature  range  of  30-60  °C.  The  program  typically  took  28  min.  The  covert  edges 
were  then  detected  by  increase  in  fluorescence. 

3.4  Laboratory  methods  for  DNA  Library  construction  and  analysis 

From  DNA  sequences  designed  by  SynDCode,  several  DNA  oligonucleotides  were  used 
to  create  a  library  using  parallel  overlap  assembly  [2,3].  The  DNA  sequences  are  given 
in  Figure  2. 

These  sequences  were  initially  mixed  as  follows.  In  one  tube,  0.5nmoles  of 
5TCRT1T2L,  5’PCR  F1T2L,  5TCRT1F2L,  5TCRF1F2L,  T2RF3L  ,  T2RT3L, 
F2RF3L,  F2RT3L  with  1  nmole  of  C(F2),  C(T2),  C(T1),  and  C(F1)  were  combined.  In 
another  tube,  0.5nmoles  of  F4RF5L,  T4RT5L,  F3RF4L,  T4RF4L,  F3RT4L,  F4RT5L, 
T3RF4L,  T3RT4L  with  Inmole  of  C(F3),  C(T3),  C(F4),  and  C(T4)  were  combined. 
These  tubes  were  heated  in  a  boiling  water  bath  for  10  minutes  and  then  allowed  to  cool 
slowly  to  room  temperature.  These  annealed  oligonucleotides  were  treated  with 
polynucleotide  kinase  and  then  ligated  for  15  minutes  at  room  temperature.  Next,  a 
portion  of  the  second  tube  was  combined  with  a  tube  containing  Inmole  of  T5R3'PCR, 
F5R3’PCR,  C(T5)3 'PCR-BIOTIN,  C(F5)3’PCR-BIOTIN  oligonucleotides  that  had  been 
annealed  and  treated  with  polynucleotide  kinase.  These  combined  oligonucleotides  were 
ligated  and  then  combined  with  the  first  tube  and  ligated.  Portions  of  these  samples  were 
then  amplified  with  PCR  using  the  5’PCR  and  3 ’PCR  primers  and  analyzed  on 
polyacrydamide  gels.  For  cloning,  we  cut  the  PCR  products  with  BamHI  and  Hindlll  and 
ligated  the  fragments  into  a  similarly  cut  pBluescript  SKII  vector  and  transformed  into  E. 
coli  cells.  Isolated  plasmids  were  sequenced  using  automated  sequencing  reactions  and 
separated  on  an  ABI  Model  310  DNA  sequencer. 
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4.  Results  and  Discussion 


4.1  Random  design  model 

Let  0</?<l.  A  random  pool  (or  row  of  a  random  matrix)  in  [77]  can  be  described  as 
an  77  -sequence  of  i.i.d  Bernoulli  trials  X  =  X1,...,Xn  =  (Xj)  where  each  Xj  is  1 
with  probability  p  . 

Throughout  this  paper  p  is  reserved  and  assumed  to  be  p  =  Pr  ob(XL  /  =  1). 

Then  given  n,p,k,d  and  a  fixed  T  =  {C1?...,Crf}  as  above,  we  define  the  binary 
random  variable  Y  to  be  1  if  the  pool  X  is  positive.  Given  N  random  pools,  then 
for  each  i  in  1  <  i  <  N,  then  A  =  (Xt  J )  where  Xt  ■  is  the  ijth  entry  of  the 
random  matrix.  With  Yi  corresponding  to  X  ,  we  have  the  sequence 
X  =  Yi,...,Yn  =  (Yi)  where  the  random  variable  Yt  is  1  if  and  only  if  the  pool  X  is 
positive.  In  other  words, 

Y,=  1  <=>  there  is  Cl  =  {jx,...,jk}  e  T  such  that  Xi  jx  = ...  =  XiJk  =  1 
Clearly,  Yt  s  are  i.i.d.  It  is  straightforward  to  verify  that  for  a  given  Y  =  {C^,...,Cd}  as 
described  above  that  Pr  ob(Yt  =  0)  =  (1  -  pk  )d  .  We  call  Y  the  output  vector.  See  Figure 


Figure  3:  A  random  design 

Here  k=2,  d=3  and  T=  { { 1 ,2} ,  { 1 2, 1 3 } ,  {24,25 } } .  Pool  i 
is  positive  if  and  only  if  row  i  has  Is  in  columns  1  and 
2,  columns  12  and  13,  or  columns  24  and  25. 
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4.2  Interpreting  output 


A  combination  of  the  random  matrix  (Xtj)  and  its  associated  output  vector  ( Yi )  is 

used  to  define  a  new  matrix  ( Ztj ).  Forgiven  n,p,k,d  ,  T  =  {C1;...,C^},  and  an 

instance  (X,-  ■)  as  described  above,  suppose  we  have  the  resulting  (A, ).  For  each 

(/,/)  eJVx  [»],  we  define  the  random  variable  Zi>y-  =  Xtj  +  Yt  + 1  (mod2).  For  each 

7<e[/z],  the  sequence  Z  =  Zlj,...,ZNj  =  (Z^j)  consists  of  i.i.d  Bernoulli  variables. 

However,  for  /,  ^  j2  it  is  not  always  the  case  that  Zi  j  and  Zt>j  are  independent 

or  identically  distributed  and  it  is  this  difference  that  we  exploit.  Clearly  the  distribution 
of  Z-  ■  depends  on  the  given  j .  Consider  an  instance  of  a  random  matrix  ( X t  ■ )  as 

j  column  vectors,  then  the  matrix  (Zl  ■ )  results  from  adding  the  complement  of 

output  vector  F  to  each  column  of  (X l  ■ ).  The  binary  matrix  is  constructed  from 
(Xj  j  )  and  Y  and  shown  in  Figure  4. 


Figure  4:  Adjusted  random  matrix  (Z}  j ) 


Matrix  (Z;-  j  )  constructed  from  (X  j  j )  and  Y .  The  weights  of 
the  columns  are  given  at  the  bottom.  If  the  threshold  is  t  =  0.63, 
then  [ Nt~\  -  19.  With  this  value  of  t,  the  columns  taken  are 
j  =  1,2,13,23,24.  Notice  that  all  but  j  =  23  is  an  element  of  a 
covert  edge  and  each  covert  edge  has  been  hit  at  least  once.  Also  the 
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values  of  p  =  .467  and  t  =  .63  where  chosen  to  minimize  N  when 
d 

50%  of  the  U  Cj  is  expected  to  be  identified  and  the  error  rate 
1=1 

(Pr  ob(T  |  G))  is  0.04. 


N 

For  each  j  e[n\,  we  define  FNj  =  ^dZij.  Then  for  each  j  e[n],  FN  j  gives  the 

t=i 

number  of  l.v  in,  or  the  weight  or  frequency  of,  the  jth  column  the  in  matrix  {Zi  ■ ). 
Given  a  N  x  n  matrix  (Zjj)  and  an  a  threshold  t,  take,  as  the  objects  of  the  search, 
all  columns  of  (Z,  ■)  whose  weight  is  at  least  Nt  .  See  Figure  4.  In  Section  4.4,  the 

method  of  choosing  the  values  of  p  and  t  so  as  to  minimize  the  number  of  pools  N  is 
given. 


4.3  Distributions,  parameters  and  design  functions 

For  n,k,d  and  T  =  {Cj,..., Cd)  as  described  above,  let 


d 

Bad  =  U  Ci 


and 


/= l 

Good  =  [n\-  Bad  .  For  j  eGood  ,  let  pJg  be  the  probability  that  Z;  y  =  1  and  let 
pJb  be  the  probability  that  Z;  ■  =  1  if  Bad.  It  is  straightforward  to  verify  that 
pi'  =  pJg2  if  j\,J2  e  Good  and  because  T  is  a  disjoint  family  we  have  that 
Pb  =  P  'b  ^  yj  ,  /2  e  Bad.  Thus  we  let  pb  and  pg  respectively  denote  pb  and 

pJg.  Below  we  compute  the  values  of  pb  and  pg.  These  values  are  different  and  it 
is  this  difference  that  allows  us  to  distinguish  between  the  Good  and  the  Bad  . 


For  j  e  Bad  ,  we  let  b0  0  be  the  probability  that  Xi  j=  0  and  Yl  =  0  and  bu  be 
the  probability  that  XLj  =  1  and  Yt=  \  .  Then  pb  =b00+bll.  For  j  eGood  ,  we 
let  g0  o  be  the  probability  that  Xi  j  =  0  and  Yt  =  0  and  gL  |  be  the  probability  that 
XLj  =  1  and  Yt=\  .  Then  pg  =  g0,o  +  Su-  F°r  j  ^  Bad  ,  we  let  b]  ()  and  bQ  ] 
respectively  be  Proh(X!  i  =l  ,  Yt  =  0)  and  Yrob(Xi  j  =  0  ,  Yt  =  1).  Similarly,  for 
j  eGood  ,  let  gl0  and  g01  respectively  be  Yrob(Xi  j  =  \  ,  Yt  =  0)  and 

Yvob(Xi  j  =  0  ,  Yi=\).  As  was  noted  in  Section  Pro/>(Ij  =  0)  =  (l-pk )d .  It  is 
straightforward  to  see  that  since  Y  is  a  disjoint  family  that  h{)  ()  =  (1  -  p)(\  -  pk  )d  1 
and  g0y  =(^~  P)(^~  pk)d  ■  Thus  pb- pg=2{\- p)pk{\- pk)d~\ 
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For  each  je  Bad,  Good,  we  have  that  FN  j  =b(N,pb )  ,  b(N,pg )  respectively 
where  b(N,p )  is  the  binomial  distribution.  Let  FN  j=F ^  ,  Fy  for  j  e  Bad , 

Good  respectively.  Values  for  p  and  a  threshold  t  that  discriminate  between  objects 
in  Bad  and  Good  are  desired.  Using  the  normal  approximation  to  the  binomial,  we 
can  optimize  in  the  following  way.  Assume  Fy  is  normal  with  ph  =  Apfe  and 
<jh  =  y/  A/;,,  (1  -  ph )  and  F|  is  nonnal  with  //„  =  Npg  and  ag  =  ^ Npg(l-pg  ). 
Let  t  denote  a  frequency  threshold.  Let  zh(N,t),  zg(N,t )  be  the  z-score  of  At 

under  Fy ,  Fy  respectively.  Thinking  of  zh(N,t)  and  zg{N,t )  as  parameters,  we 
express  A  and  t  as  a  function  of  p  .  We  have: 


zb(N,t)  = 


z  (N,t)  = 


=  Va- 


Nt-Nph 
^Nph(l-ph)  V Ph(X-Pb ) 


.  =  w-  '"p* 


■jNpg(l-pg)  JpF-P  i-) 


Combining  these: 


(A, t)jph (1  _  zg OjPg (l~Pg) 

t~Pb  t-Pg 


and 


Finally, 


*Cp)  = 


PgH (# , t)^ph(\-  ph)  -  phzg  (A, t)Jpg(l-pg) 
zb  (N>  O^PbQ-Pb)  -  zg  (N’  0 -JPgV-Pg) 

zb(N,t)^pb(\-Pb)'' 


N(p)  = 


Kp)-pb 


4.4  Optimization  methods 

Given  n,p,k,d  and  T  =  {Ci,...,Cd}  as  above  and  values  of  zh  ^zh(W(}i)t(}ii 

and 

zg  =  zg  ( N(p),t(p ))  ,  we  define  the  following  events  for  each  j  e  [//] :  G  is  the  event 
that  j  €  Good  ,  B  is  the  event  that  j  e  Bad  and  T  is  the  event  that 
-^(fWf)]  We  refer  to  T  as  the  element  j  attaining  the  threshold.  We 

have  that  Pr ob(B)  =  ^-,  Prob(G)  =  1--^-,  Prob(T  |  F)  =  Prob(A(0,l)  >  zft)  and 
Pr  ob(T  |  G)  =  Pr  ob{  A(0, 1)  >  zg ).  Then  the  probability  that  an  element  j  is  a  member 
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of  some  complex  given  that  it  has  attained  the  threshold  is: 

Pr  ob(T  |  B)  Pr  ob(B) 


Vxob(B  |  T)  = 


Pr  ob(T  |  B )  Pr  ob(B)  +  Pr  ob(T  \  G )  Pr  ob(G ) 


_ ?rob(N(0,l)>zb)f _ 

Pr  ob(N(  0, 1)  >  zb  ) ^  +  Pr  ob(iV(0, 1)  >  zg  )(1  -  f) 


- Pr0i('V(°-1)-Z'’) - when  „  □  k,d 

Prob(iV(0,l)  >  zb)  +  yyVxob(N (0,1)  >  zg ) 


The  expected  number  of  objects  identified  in  a  given  complex  is 
kVxob(T  |  B )  =  A;Prob(jV(0,l)  >  zb ).  The  expected  number  of  objects  misidentified  is 

(n  -  kd)Fx ob(T  |  G)  =  (n  -  kd)Vx  ob(N (0,1)  >  zg).  We  consider  six  special  cases: 

1 .  Suppose  that,  on  average,  1  object  of  each  complex  is  to  be  identified.  In  this  case, 
Prob(iV(0,l)  >  zb)  =  j-  and  Pr  ob(B  I  T ) - - - .  Thus  to  be  sure  that 

V  V  ’  b’  k  vi/  l+iVpro6(7V(0,l)>zg) 

0.5,  jy  and  yy  of  the  objects  attaining  the  threshold  are  actually  members  of  a 
complex,  then  Prob(iV(0,l)  >  zg)  must  be  y  ,  yy  and  y  respectively. 


2.  Suppose  that,  on  average,  half  of  each  complex  is  to  be  identified.  In  this  case, 
Prob(/V(0,l) >  zb)  =  \  and  Pr ob(B  IT) - ? - - -  .  Thus  to  be  sure  0.5, 

V  V  ’  7  b>  2  VI/  \+^7rob{N{0,\)>zg) 


k 

k+ 1 


and  yy j-  of  the  objects  that  attain  the  threshold  are  actually  members  of  a  complex, 
Prob(iV(0,l)  >  z  )  must  be  ,  -tj-  and  respectively. 

o  Zn  Zn  Zn 
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Figure  5:  Graph  of  N(p) 

In  Figure  5,  the  graphs  of  N(p)  are  displayed.  In  each  case  zb  =-0.253  and 
zg  =3.090.  Thus  the  Vvob(T  \  B)  =  0.60  and  Vvob(T  \  G)  =  0.001.  When  k  =  2 
and  d  =  5  ,  the  minimum  value  of  N( p)  =  303  is  attained  when  p  =  0.384  .  For  this 
value  of  p  we  have  that  that  0.488  =  pg  <  t(p)  <  pb  =  0.584  and  t{p)  =  0.577 

Recall  that  ^(.384)  ~  h(303, 0.584)  and  384)  ~  b(303, 0.488).  Thus  for  our  optimal 
values,  we  have: 

Pr  obiT  |  B )  =  Proh(F^(0384)  >  [jV(0.384)f(0.384)]  =  175)  =  Proh(7V(0,l)  >  -0.253)  =  0.60 
and 

Pr  obiT  |  G)  =  Pr  oh(F|(  384)  >  [iV(.384)t(.384)]=  175)  =  Proh(iV(0,l)  >  3.090)  =  .001. 

cl=5 

What  this  indicates  is  that  if  j  e  Bad  =  U  Q  ,  then  the  probability,  Prob(F303  ■  >  175) 

/= l 

=  0.6  and  on  average,  0.6  |  C/  |=1.2.  Members  of  each  complex  are  identified  by 

d=  5 

attaining  the  threshold  of  175.  On  the  other  hand,  for  j  e  Good  =  [/?]-  U  Q  ,  we  have 

/= l 

that  the  Proh(  F303  y  >175)  =  0.001  .  When  k  =  3  and  <7  =  5  ,  the  minimum  value  of 
7V(/?)  =  515  is  attained  when  p  =  0.517.  For  this  value  of  p  we  have  that  that 
0.501  =  Pcr<  t(p)  <  pb  =0.574  and  t(p)  =  0.569  .Since  k  =  3  here,  we  have  on 

average,  0. 6|C/|H  1 . 8  members  of  each  complex  identified  by  attaining  the  threshold  of 
296.  In  both  cases  k  =  2,3  and  d  =  5  ,  if  »  =  1000  ,  an  average  number  of  ~1 

member  of  Good  attains  the  respective  threshold  and  is  misidentified  as  a  member  of  a 


10 


complex.  Using  the  equation  for  Prob(B  |  T),  we  obtain  Pr  ob(B  \  T) - =  q^1+{  ■ 

°'6+w 

Thus  for  k  =  2,  d=  5,  we  have  that  Vvob(B  \  T)  =  0.857  and  when  k  =  3  and 
d=  5,  Vrob(B  \  T)  =  0.900. 

In  general,  given  n,k,d  ,  zb  and  zg  ,  where  zb  is  chosen  to  so  that,  on  average,  a 
certain  proportion  of  each  complex  identified  and  z ,,  is  chosen  to  ensure  a  certain 
degree  of  accuracy  given  by  the  conditional  probability,  we  find  the  value  of  p  that 
minimizes  N  .  Then  we  use  this  optimal  value  of  p  to  find  the  value  for  the  threshold 
t. 

4.5  Results  for  the  random  application  to  the  DNA  network 

The  pools  were  constructed  in  a  pseudo-random  method  that  is  a  heuristic  modification  to 
a  purely  random  method  that  can  be  applied  to  standard  and  non-complemented  DNA 
codes.  A  portion  of  the  abstract  random  design  is  depicted  in  Figure  6.  Figure  7  indicates 
what  the  theoretical  and  observed  output  was  from  the  Real-Time  PCR  Thennocycler. 
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Figure  6:  DNA  Network  pools  1-18 


Pool  i  consists  of  all  strands  whose  column  has  a  1  in  the  ith  row.  Sixty  pools  in  all  were  used. 
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Figure  7:  Sample  outcome  of  DNA  pool  fluorescence 

Both  the  theoretical  and  observed  outcomes  for  a  sample  of  pools 
are  given.  Note  only  two  pools  out  of  sixty  where  in  error. 


A  random  design  of  sixty  pools  was  used.  In  Figure  8,  the  column  frequencies  are  given. 
The  cutoff  was  35.  Thus  the  vertices  of  the  covert  edges  were  identified  with  100% 
accuracy. 

29  40  34  29  31  38  28  28  26  29  40  30  34  17  39  29  31  20  26  39  31  29  22  36  32  32  34  31  30  26  43  21  31 
ABCEFGI  J  KL  MNOPQRST  UVWXYZAAAB  AC  AD  AF  AG  AH  Al  AJ 

Figure  8:  DNA  network  test  frequencies 


N=60  pools.  The  threshold  is  t  =  0.58,  then  |”m"|  =  35.  With 
this  value  of  t,  the  columns  taken  are  marked  with  *. 


4.6  DNA  labeling  results 

Labeling  of  the  DNA  was  accomplished  by  using  biotinylated  or  Bodipy-FL  modified 
reverse  primer,  Alexa-fluor  modified  -20  primer,  or  by  using  labeled  nucleotides 
modified  with  either  biotin  or  digoxigenin.  These  labeled  and  unlabeled  primers  were 
obtained  from  Integrated  DNA  Technologies.  The  amplified  DNA  was  purified  using 
agarose  gel  electrophoresis  and/or  the  PCR  extraction  kit  (Qiagen).  Detection  of  the 
fluorescently  labeled  DNAs  was  done  on  a  UV  transilluminator  directly  after  separation 
without  further  staining.  Alternatively,  the  gel  was  transferred  to  a  nylon  membrane  and 
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probed  with  specific  antibodies.  We  also  performed  dot  blots  to  confirm  the  presence  (or 
absence)  of  tagged  DNA  using  the  antibodies.  In  this  case,  0.5-lul  of  DNA  solutions 
plus  controls  were  dotted  on  nylon  membrane,  treated  with  UV  then  probed  with  the 
antibodies  to  one  of  the  specific  labels.  The  antibodies  were  generally  detected  with 
secondary  antibodies  linked  to  horse  radish  peroxidase  using  a  chemiluminescent 
substrate  (Pierce).  For  the  separation  of  the  specifically  labeled  DNAs  in  solution,  we 
incubated  the  antibody  with  anti-antibody  linked  agarose  or  magnetic  beads,  removed  the 
unbound  antibody,  then  added  appropriate  amounts  of  the  labeled  DNA.  These  beads 
contained  a  complex  of  tagged  DNA  and  anti-tag  antibodies.  The  anti-antibody  beads 
were  then  washed  and  the  DNA  and  anti-tag  antibody  are  removed  using  boiling  water. 
The  presence  of  the  tagged-DNA  in  each  fraction  was  confirmed  using  gel  separation  or 
dot  blots  and  then  anti-tag  antibodies  followed  by  anti-antibodies  and  the 
chemiluminescent  substrate.  Light  from  the  substrate  was  detected  by  X-ray  film  and 
then  scanned. 

4.7  Results  of  DNA  library  construction  and  analysis 

These  sequences  were  initially  mixed  as  follows.  In  one  tube,  0.5nmoles  of 
5TCRT1T2L,  5’PCR  F1T2L,  5TCRT1F2L,  5TCRF1F2L,  T2RF3L  ,  T2RT3L, 
F2RF3L,  F2RT3L  with  1  nmole  of  C(F2),  C(T2),  C(T1),  and  C(F1)  were  combined.  In 
another  tube,  0.5nmoles  of  F4RF5L,  T4RT5L,  F3RF4L,  T4RF4L,  F3RT4L,  F4RT5L, 
T3RF4L,  T3RT4L  with  Inmole  of  C(F3),  C(T3),  C(F4),  and  C(T4)  were  combined. 
These  tubes  were  heated  in  a  boiling  water  bath  for  10  minutes,  then  allowed  to  cool 
slowly  to  room  temperature.  These  annealed  oligonucleotides  were  treated  with 
polynucleotide  kinase  and  then  ligated  for  15  minutes  at  room  temperature.  Next,  a 
portion  of  the  second  tube  was  combined  with  a  tube  containing  Inmole  of  T5R3'PCR, 
F5R3TCR,  C(T5)3’PCR-BIOTIN,  C(F5)3’PCR-BIOTIN  oligonucleotides  that  had  been 
annealed  and  treated  with  polynucleotide  kinase.  These  combined  oligonucleotides  were 
ligated  and  then  combined  with  the  first  tube  and  ligated.  Portions  of  these  samples  were 
then  amplified  with  PCR  using  the  5’PCR  and  3’PCR  primers  and  analyzed  on 
polyacrydamide  gels.  For  cloning,  PCR  products  were  cut  with  BamHI  and  Hindlll  and 
ligated  the  resulting  fragments  were  then  ligated  into  a  similarly  cut  pBluescript  SKII 
vector  and  transformed  into  E.  coli  cells.  Isolated  plasmids  were  sequenced  using 
automated  sequencing  reactions  and  separated  on  an  ABI  Model  310  DNA  sequencer. 

4.8  Results  of  labeling  and  detection  of  tagged  DNA 

DNA  strands  were  tagged  with  different  modified  bases  so  that  they  could  be  read  by 
antibodies.  PCR  primers  that  had  a  reactive  amine  group  incorporated  into  one  of  the 
bases  were  reacted  with  succimidyl  containing  Alex-fluor  or  Bodipy  compounds  to  form 
labeled  primers.  This  reaction  was  successful  in  producing  fluorescently  labeled  PCR 
products  (Figure  9). 


13 


LI  2  3  4  5  6  LI  2  3  4  5  6 


A. 

Figure  9:  Detection  of  fluorescently  labeled  PCR  products 

PCR  made  with  the  Alexa  fluor  labeled  -20  primer  (5  and  15ul,  lanes  1  and  2, 
respectively),  r  the  amino-modified  with  Alexa  fluor  -20  primer  (5  and  15ul,  lanes  3  and 
4,  respectively),  and  the  amino-modified  with  Bodipy-FL  reverse  primer  (5  and  15ul, 
lanes  5  and  6,  respectively)  were  separated  on  a  10%  polyacrydamide  gel  with  a  low 
molecular  weight  DNA  marker  (L).  The  DNA  was  visualized  by  simply  shining  UV 
light  (so  direct  fluorescent  bands)  (panel  A)  or  through  staining  of  total  DNA  using 
ethidium  bromide  (panel  B).  Fluorescently  labeled  DNA  with  the  differently  labeled 
primers  and  both  types  of  fluorescent  molecules  were  observed. 

Antibodies  for  different  tags  were  tested  for  their  ability  to  detect  the  labeled  fragments 
and  not  any  of  the  other  tags.  High  specificity  without  cross-reactivity  was  confirmed  by 
dot  blots,  loading  all  of  the  differently  labeled  DNA  fragments  and  probing  with  antibody 
to  Alexa  fluor,  Bodipy-FL  or  digoxigenin  or  with  strepavidin  that  recognizes  biotin. 

PCR  products  labeled  with  Alexa-fluor  (AF),  Bodipy-FL  (BO),  digoxigenin  (DIG)  or 
biotin  were  dotted  on  a  nylon  membrane  and  then  probed  with  specific  antibodies  to 
Alexa-fluor  (far  left  blot),  Bodipy  (middle  blot)  or  to  digoxigenin  (far  right  blot). 
Detection  of  antibody  binding  was  done  through  anti-rabbit  antibodies  labeled  with 
alkaline  phosphatase  and  then  colorimetric  detection  of  the  enzyme.  Dark  spots  indicate 
enzyme  detection  of  antibody  binding.  Thus,  the  antibodies  detect  only  the  PCR  products 
labeled  with  the  specific  molecule  that  is  their  antigen. 

4.9  Isolation  of  tagged  DNA  using  antibody  binding 

In  order  to  use  these  tagged  DNA  fragments  in  a  computation,  there  is  a  need  to  separate 
DNA  that  was  tagged  from  unmodified  DNA.  The  general  scheme  is  shown  in  a  diagram 
in  Figure  10.  For  this,  antibodies  to  those  tags  bound  to  beads  were  used.  Protein  A 
agarose  beads  that  bind  many  types  of  antibodies  including  those  made  against  Alexa 
fluor,  Bodipy  and  digoxigenin  were  used  as  were  magnetic  beads  modified  with  anti- 


14 


rabbit  antibodies  that  bind  as  well  to  the  antibodies  to  these  three  tags.  For  the  biotin, 
strepavidin  linked  magnetic  beads  that  very  effectively  bind  biotinylated  DNA  were  used. 


equences  that  contain  either 
Tagl  or  Tag2  : 

Tag  1  DNA,  Tag2DNA, 
aglTag2DNA _ 


Figure  10:  Binding  reactions  with  differently  tagged  DNA  strands 

DNA  labeled  with  one  or  more  tags  or  unlabeled  can  be  separated  and  put  into  mixtures 
in  the  presence  of  antibodies  for  the  different  tags.  Binding  of  these  antibodies  and  then 
subsequent  separation  of  the  bound  material  using  beads  linked  with  the  antibodies  will 
separate  out  the  tagged  DNAs.  This  can  be  used  in  a  computation  to  isolate  DNA  that 
satisfies  the  clause  Tagl  OR  Tag2  (p  OR  q). 

For  these  reactions,  PCR  products  labeled  using  one  primer  with  Alexa-fluor  and  the 
other  primer  labeled  with  biotin  were  used  with  one  of  them  bound  the  DNA  with 
antibodies  to  the  Alexa-fluor  linked  to  magnetic  beads  with  anti-antibodies  or  with 
strepavidin  linked  magnetic  beads.  The  unbound  material  was  then  removed  after 
incubation,  the  beads  washed  and  the  bound  material  isolated  by  boiling  the  beads. 
Detection  of  the  tags  in  the  bound  and  unbound  material  was  done  by  either  dot  blots  or 
by  first  separating  the  DNA  on  a  gel,  and  transfer  to  a  membrane.  In  both  cases,  the  tags 
were  detected  with  antibody  to  Alexa-fluor  or  with  strepavidin  that  binds  to  biotin. 
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Figure  11:  Separation  of  labeled  DNA  using  antibodies 

DNA  labeled  with  Alexa-fluor  and  biotin  (PCR)  was  incubated  with  Alexa-fluor 
antibodies  and  the  bound  (AF  bound)  and  unbound  material  (AF  unbound)  was  separated 
on  a  gel.  This  DNA  was  also  incubated  with  strepavidin  and  the  bound  (strep  bound)  and 
unbound  material  (strep  unbound)  was  separated  on  the  same  gel.  See  Figure  11,  above. 
Detection  using  the  anti-Alexa  fluor  antibody,  panel  A,  and  streptavidin,  panel  B,  was 
conducted.  Labeled  DNA  is  detected  in  the  bound  and  some  in  the  unbound  fractions. 
The  bands  that  are  higher  in  the  gel  than  the  PCR  product  is  likely  to  be  antibody  that 
may  be  linked  to  DNA.  In  panel  C,  different  amounts  of  anti-Alex  fluor  antibody  (0.5  or 
1.0  ul)  was  used  to  see  if  that  changed  the  binding  reaction.  Then  the  bound  (B)  and 
unbound  (U)  material  was  dotted  onto  nylon  membrane  and  biotin  detected  with 
strepavidin.  In  both  cases,  there  was  no  tag  detected  in  the  unbound  fraction  but  it 
appeared  that  lower  amounts  of  antibody  could  bind  more  PCR  product  to  the  beads. 
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4.10  Construction  of  the  DNA  library 


A  5-site,  2-variable  library  of  sequences  using  codes  designed  by  SynDCode  was 
constructed  by  combining  the  oligonucleotides  using  a  parallel  overlap  assembly 
approach.  We  combined  the  oligonucleotides  in  3  stages  and  monitored  the  appearance  of 
appropriate  bands  during  the  assembly  using  polyacrydamide  gel  electrophoresis  (Figure 
12). 

1  2  3  4  LMW  5  6  LMW 


+  i 

L, 


Figure  12:  Library  preparation,  amplification  and  digestion 

A  ~175bp  band  appeared  in  the  final  ligation  step  as  expected  (Figurell, 
lanes  2  and  4).  PCR  amplification  of  these  mixtures  using  primers  at  the 
ends  of  the  library  sequences,  produced  the  appropriate  sized  products 
(Figure  11,  lane  6). 

Fragments  representing  sites  1-2  were  combined  and  ligated  and  produced  primarily 
fragments  at  75bp  as  expected  (lanes  3  and  4)  while  fragments  representing  sites  3-4 
(lane  1)  and  sites  3-5  (lane  2)  showed  major  fragments  within  the  mixture  at  75bp  and 
~85bp,  respectively.  Ligation  of  fragments  representing  sites  1-5  produces  a  ~1 75bp 
fragment  (marked  with  an  arrowhead  in  lane  2;  also  present  in  lane  4)  while  ligation  of 
fragments  representing  parts  1-4  produces  a  faint  slightly  smaller  fragment  (lane  1). 
LMW  represents  molecular  weight  marker  bands  of  size  (starting  from  bottom)  25bp,  50, 
75,  100,  150,  200  (slightly  brighter),  250,  300,  400,  500,  766bp.  PCR  of  the  ligation 
mixture  from  lane  2  with  primers  representing  sequences  outside  the  site  1  and  site  5 
(3 ’PCR  and  5’ PCR)  produces  two  bands  at  150  and  ~175bp  (lane  6).  When  this  DNA  is 
cut  with  BamHI  and  Hindlll  (situated  just  inside  the  primer  sites),  a  strong  ~1  lObp  band 
is  produced  (marked  with  arrow  in  lane  5).  This  is  the  expected  size  for  a  band  to 
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represent  5  sites  of  20bp  each  having  extra  12  bp  to  represent  the  two  restriction  sites. 
This  digested  band  was  cloned  into  the  vector  pBluescript  for  sequencing. 


4.11  Analysis  of  DNA  library 


Library  strands  were  cloned  to  check  whether  representative  sequences  were  present.  For 
cloning  and  sequencing  of  these  library  DNAs,  amplified  product  was  cut  with  BamHI 
and  Hindlll  and  purified  the  fragment  on  a  gel  (Figure  12,  lane  5)  where  the  expected 
~1  lObp  product  was  formed.  A  number  of  colonies  were  obtained  and  6  were  sequenced. 
Table  2  contains  the  sequences  of  the  clones  and  the  site  identifications  for  each.  The  6 
sequences  obtained  represented  unique  different  fragments  of  the  predicted  library  with 
one  clone  have  a  mutation  in  the  T3  site  (clone  A)  and  one  clone  have  a  single  extra  base 
at  one  end  (clone  5).  These  changes  could  have  been  present  in  the  oligonucleotides  used 
to  produce  the  library  or  could  have  been  introduced  by  the  Taq  polymerase  that  was 
used  to  amplify  the  library  before  cloning.  Finding  only  these  unique  fragments 
representing  individually  all  the  possible  sites  (F  or  T  at  each  of  the  5  sites),  is  consistent 
with  the  hypothesis  that  we  have  obtained  all  32  individual  members  of  the  library  during 
this  construction  process 


Clone 

name 

Clone  sequence  (site  5->site  1  direction) 

Site  designations 

E 

ATGTGTTGAGGTGGTGGAAA- 
TGAAGGAAGTGAGAGTGAGG- 
AT  GAGGT  G  ATGGAGGAAAGG- 
AAAGGTGTAGGTGGTGTAGG- 
GTTGGTAGAGTGGTTGGTTG 

FI  F2  T3  T4  F5 

G 

GGTTTTGAGTGGAGAGGAGA- 
TGGAAGAAGT  GAGAGAGGT  G- 
AT  GAGGT  G  ATGGAGGAAAGG- 
TGATTGTGGAGGAGTTGTGT- 
GTTGGAAAGTGGAGGTTTGG 

T1  T2  T3  F4  T5 

A 

ATGTGTTGAGGTGGTGGAAA- 
TGGAAGAAGT  GAGAGAGGT  G- 
AATTGTGTGTGTGCGTGTGA- 
AAAGGTGTAGGTGGTGTAGG- 
GTTGGAAAGTGGAGGTTTGG 

T1  F2  F3*  F4  F5  (has  a 

T^C  mutation  in  F3 
sequence) 

5 

ATGTGTTGAGGTGGTGGAAA- 
TGAAGGAAGTGAGAGTGAGG- 
AT  GAGGT  G  ATGGAGGAAAGG- 
TGATTGTGGAGGAGTTGTGT- 
GTTGGAAAGT  GGAGGNTT  GA 

T1  T2  T3  T4  F5  (has  an 
extra  A  at  the  end  of  Tl) 

4 

GGTTTTGAGTGGAGAGGAGA- 
TGGAAGAAGT  GAGAGAGGT  G- 
AATTGTGTGTGTGTGTGTGA- 
TGATTGTGGAGGAGTTGTGT- 
GTTGGT  AGAGT  GGTT  GGTTG 

FI  T2  F3  F4  T5 

F 

ATGTGTTGAGGTGGTGGAAA- 
TGGAAGAAGT  GAGAGAGGT  G- 
AT  GAGGT  G  ATGGAGGAAAGG- 
TGATTGTGGAGGAGTTGTGT- 
GTTGGAAAGTGGAGGTTTGG 

Tl  T2  T3  F4  F5 

Figure  13:  Sequences  of  individual  clones  from  DNA  library 
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5.  Conclusions  and  future  work 


5.1  DNA  networks 

The  fact  that  simple  network  motifs  can  be  discovered  using  DNA  has  been 
demonstrated.  Future  work  entails  algorithm  development  to  find  more  complex  motifs. 


5.2  Tagged  DNA 

DNA  fragment  strands  were  successfully  labeled  with  4  different  molecules,  and  it  was 
shown  that  the  antibodies  to  the  molecular  tags  were  indeed  specific  to  those  tags.  The 
process  of  perfecting  protocols  to  select  tagged  DNA  using  tag  antibodies  continues.  The 
highest  yielding  method  is  expected  to  be  the  permanent  linkage  of  the  antibody  to  the 
magnetic  beads.  This  method  should  reduce  the  noise  in  the  detection  procedure  when 
the  antibody  is  still  present. 


5.3  DNA  library  construction,  analysis  and  selection 

A  5-site  2-variable  DNA  library  using  sequences  generated  by  SynDCode  was  created. 
Statistical  sampling  from  this  collection  indicates  that  the  library  is  likely  complete  as  all 
fragments  were  unique  members  of  this  32-member  library.  Work  on  protocols  for 
detection  of  specific  probe  members  using  complements  immobilized  on  membranes 
continues. 
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